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METHODS OF MODIFYING THE PRODUCTION OF ISOPENTENYL 
PYROPHOSPHATE, DIMETHYLALLYL PYROPHOSPHATE AND/OR 

ISOPRENOIDS 



BACKS 



5 finM of the Invention 

The present invention is directed to genes encoding deoxyxylulose-5- 
phosphate synthase (dxps), glyceraldehyde-3-phosphate dehydrogenase (gapd), 
and the lytB gene product, as well as vectors containing the same and hosts 
transformed with said vectors. The present invention also provides methods for 
10 modifying the production of isopentenyl pyrophosphate (IPP) and/or dimethylallyl 
pyrophosphate (DMAPP) and/or an isoprenoid compound (i.e., a compound derived 
from IPP and/or DMAPP). Additionally, the present invention provides a method 
for screening for procaryotic and eukaryotic genes encoding enzymes that 
participate in the nonmevalonate pathway leading to IPP and DMAPP. 



15 Backs 



A plethora of chemical compounds are produced by what are collectively 
known as the pathways of isoprenoid (a.k.a. terpene) biosynthesis. The one feature 
in common to the many isoprenoids (more than 20,000 have been identified in 
plants; Chappell. 1 995) is their biosynthesis from the central metabolites and 

20 building blocks for all isoprenoid compounds: the 5 carbon compounds isopentenyl 
pyrophosphate (IPP) and its allylic isomer dimethylallyl pyrophosphate (DMAPP). 
The interconversion of IPP and DMAPP is reversible, and is catalyzed by the 
enzyme IPP isomerase (IPI). Using the IPP and DMAPP building blocks in various 
combinations, a modular assembly process that produces compounds of 5, 10, 15, 

25 20 or more carbons (in multiples of 5) allows the biosynthesis of the basic skeletons 
for the many and various isoprenoids with a relatively small number of basic 
reaction steps. The skeleton of plant carotenoid pigments, for instance, is 
assembled from two molecules of a C a compound, geranylgeranyl pyrophosphate 
(GGPP), that is itself assembled from 3 units of IPP and 1 unit of DMAPP, and that 

30 also serves as a precursor for many other branches of the isoprenoid pathway in 
plants. 

Isoprenoids are formed from IPP and DMAPP in at least three different 

1 
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compartments of plants cells: the cytosol/endoplasmic reticulum, the mitochondria 
(and/or Golgi apparatus), and the plastids (McGarvey and Croteau. 1995). The 
source of IPP and DMAPP for isoprenoid biosynthesis in these different 
compartments has long been a matter of some controversy and debate (see Bach, 
5 1995; McGarvey and Croteau, 1995). The well-known "classical" or 
acetate/mevalonate route to IPP and DMAPP proceeds from acetyl-CoA via 3- 
hydroxy-3-methylglutaryl-CoA (HMG-CoA) and mevalonic acid (MVA). The critical, 
rate-determining and irreversible step in this route in animals is the reduction of 
HMG-CoA to produce MVA. catalyzed by the enzyme HMG-CoA reductase 
10 (HMGR; EC 1 .1.1.34; reviewed in Goldstein and Brown, 1990). This enzyme also 
appears to provide an important control point for substrate flow into certain 
isoprenoids in plants (Bach. 1986; Chappell et al.. 1995; Schaller et al., 1995; 
Stermer et al., 1994). HMGR has been localized in the cytosol of plants, 
associated with membranes of the endoplasmic reticulum (reviewed in Bach, 1995). 
15 There is no convincing experimental evidence that this key enzyme also resides in 
the plastids or mitochondria of plants. An uptake by plastids of IPP to support 
isoprenoid biosynthesis in this organelle has been suggested in the past (Kreuz and 
Kleinig. 1984). However, it has been difficult to reconcile a cytosolic source of IPP 
for plastid isoprenoids with the effective inhibition of cytoplasmically-made 
20 isoprenoids (e.g. sterols) but not of plastid-made isoprenoids (e.g. carotenoids) or 
mitochondrial ones (quinones) by mevinolin, a specific inhibitor of HMG-CoA 
reductase (discussed in Bach, 1995 and McGarvey and Croteau. 1995). 
Furthermore, the demonstrated ability of isolated chloroplasts of Acetabularia 
(Moore and Sheperd, 1977) and spinach (Schulze-Seibert and Schultz, 1987) to 
25 synthesize carotenojds and other isoprenoids from 14 C0 2 necessarily implies a 
chloroplast pathway for IPP biosynthesis. Further confusion arose from reports that 
radiolabelled acetate is efficiently incorporated by isolated chloroplasts into fatty 
acids but not into carotenoids (see Kleinig, 1989; Liedvogel, 1986; Schulze-Seibert 
and Schultz, 1987). a puzzling observation that has been explained by invoking a 
30 metabolic "channeling" that somehow renders the newly-formed acetyl-CoA pool 
available for fatty acid biosynthesis but unavailable for IPP biosynthesis (Kleinig. 
1989). 

2 
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An attractive explanation for these many puzzling and apparently 
contradictory observations is provided by the suggestion of an "alternative" or non- 
mevalonate pathway for the biosynthesis of IPP and/or DMAPP in the chloroplasts 
of plants (Lichtenthaler et al.. 1997) and in algae (Schwender et al., 1996). 

5 Existence of a distinct non-mevalonate biochemical pathway to IPP and/or DMAPP, 
using pyruvate rather than acetate as a substrate, was posited for certain bacteria, 
including Escherichia coli, as early as 1981 (Pandian et ah, 1981), but this 
alternative pathway in bacteria received little attention until relatively recently (Zhou 
and White, 1991; Rohmer et al., 1993 and Horbach et al., 1993; see Bach, 1995 

10 for a critical discussion of this topic). It appears that many bacteria do not have the 
acetate/mevalonate pathway for biosynthesis of IPP. For instance, a BLAST 
search of the E. coli genome, the entire sequence for which has now been 
deposited in GenBank, does not identify any open reading frames with significant 
similarity to known bacterial, plant or mammalian HMG-CoA reductases. Nor can 

1 5 we discern an HMG-CoA reductase homologue in the genome sequence data base 
of the cyanobacterium Synechococystis 6803. Because ancestors of the 
cyanobacteria are the presumptive progenitors of plant chloroplasts, the non- 
mevalonate route to IPP in these photosynthetic procaryotes is likely to resemble 

that in plant chloroplasts. 

20 Recent reports indicate that plant and algal chloroplast isoprenoids, including 

carotenoids (Lichtenthaler et al., 1997; Schwender et al., 1996). and the medically- 
important isoprenoid paclitaxel (a.k.a. TAXOL®) (Eisenreich et al., 1996) are made 
from IPP supplied via a non-mevalonate pathway. Whether this pathway supplies 
IPP for carotenoids and other isoprenoids manufactured in the various 

25 nonphotosynthetic plant plastids (e.g. chromoplasts, leucoplasts, amyloploasts) or 
in etioplasts or developing chloroplasts is an open question. Several recent reports 
describe a thiamin-dependent enzyme from E. coli (Sprenger et al., 1997; Lois ef 
al., 1998) and from peppermint (Lange ef al., 1998) that converts pyruvate and 
glyceraldehyde-3-phosphate (GAP) to deoxyxylulose-5-phosphate (DXP). A 

30 homologue of this enzyme was earlier described for Arabidopsis thaliana (Mandel 
ef al., 1996). but the function of the A. thaliana polypeptide had not been 
ascertained. 
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The evidence is compelling that DXP is a substrate in the pathway leading 
to IPP or DMAPP in plants, algae, and certain bacteria including E. coli (Arigoni et 
al., 1997). However, DXP also is a substrate leading to other essential compounds 
in these organisms, including thiamin and pyridoxal (see Lois et al., 1998). 

5 Therefore the biochemical reaction catalyzed by the DXP synthase enzyme is not 
dedicated or restricted to providing substrate for the pathway or pathways leading 
to IPP and/or DMAPP, and isoprenoids derived therefrom. Because the product 
of the reaction is shared with several other pathways, the DXP synthase enzyme 
would not seem to be an obvious candidate for a controlling step in the pathway 

10 leading to isoprenoids. An ability to modify isoprenoid production, enhancing 
production of desired compounds or reducing that of undesirable compounds, 
would be quite advantageous in many applications. 

Plausible routes to IPP from pyruvate and GAP via DXP have been 
suggested for plants and bacteria (Schwender et al, 1996; Rohmer et al., 1993 and 

15 1996; see Bach, 1995). A gene encoding what may be the second enzyme in the 
pathway, DXP reductoisomerase, has recently been identified in Escherichia coli 
(Takahashi et al. f 1998). The precise nature and sequence of subsequent 
biochemical reactions in the nonmevalonate pathway, and whether regulation of 
flux through this pathway occurs primarily at a single controlling step, as with HMG- 

20 CoA reductase for the mevalonate pathway, are so far unknown. We have taken 
a genetic approach to delineate the pathway leading to IPP and DMAPP in £. coli 
and plants. 

We earlier made the surprising observation that the reversible 
interconversion of IPP and DMAPP limits the accumulation of carotenoids in E. coli 

25 (an observation alluded to in Sun et al., 1996 and Sun et aL, 1998). Others 
(Kajiwara et al., 1997) have made a similar observation. As a direct result of 
continuing to pursue the same empirical approach (using a colored isoprenoid 
compound to "report" on the amount of IPP and DMAPP available for isoprenoid 
biosynthesis) which revealed that the isomerization of IPP is a limiting step in 

30 isoprenoid biosynthesis (see U.S. Patent No. 5,744,341 ), we have now identified 
several other genes which can be used to modify IPP, DMAPP and/or isoprenoid 
production in host cells. 

4 
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For two of the these genes the functions of the products, gIyceraldehyde-3- 
phosphate dehydrogenase and DXP synthase, are well known. The role of the 
product of the third, the lytB gene, is still obscure. Because certain mutations in 
this gene in Escherichia coli confer an ability to tolerate elevated levels of penicillin 
5 (Harkness et al., 1 992; Gustafson et al., 1 993) the lytB gene product would seem 
to play some role in or otherwise influence the biosynthesis of peptidoglycan. It has 
been suggested that the lytB gene product is not an enzyme directly involved in the 
pathway leading to peptidoglycan but rather exerts some control on the production 
of the global regulator molecule guanosine 3\5'-bisphosphate (ppGpp) (Gustafson 
10 et aL, 1993; Rodionov and Ishiguro, 1995). The evidence in support of this 
suggestion is correlative and indirect and therefore by no means convincing. 
Because lytB influences the accumulation of certain isoprenoids in £. coli (see 
below), because an analysis of the nearly twenty completed bacterial genomes 
reveals it to be one of the very few genes found only in those bacteria that also 
15 contain genes encoding DXP synthase and DXP reductoisomerase, and because 
we have isolated an apparently chloroplast-targeted lytB cDNA from a eucaryotic 
plant, we speculate that lytB may encode an enzyme that catalyzes one of the later 
reactions in the nonmevalonate pathway leading to IPP and DMAPP in certain 
bacteria and in plant chloroplasts. To avoid confusion, it is important to note that 
20 another gene, which is completely unrelated to the gene under consideration here, 
has unfortunately also been given the name lytB (see, e.g., Garcia et al., 1999). 

There remains a need for methods to enhance the production of desirable 
isoprenoids and reduce the accumulation of undesirable isoprenoids in plants, fungi 
and bacteria that contain a nonmevalonate pathway. There also remains a need 
25 in the art for methods for screening for procaryotic and eukaryotic genes encoding 
enzymes of isoprenoid biosynthesis and metabolism. 

SI IMMARY OF THF INIVFNTION 

A subject of the present invention is an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
30 acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 

5 
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85% identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
5 acid sequence encodes a protein which has an amino acid sequence which is at 

least 85% identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
1 0 sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. 

The inventionalso includes vectors which comprise any of the nucleic acid 
sequences listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of enhancing the 
15 production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having DXP synthase, GAP 
dehydrogenase and/or LYTB activity, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic 
20 acid sequence, thereby producing the protein. 

Yet another subject of the present invention is a method of modifying the 
production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies DXP synthase, GAP 
25 dehydrogenase and/or LYTB activity in the host cell, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence, thereby producing the protein. 

The present invention also includes a method of expressing, in a host cell, 
a heterologous nucleic acid sequence which encodes for a protein having DXP 
30 synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 



<WO __99S684SA1 I > 



WO 99/58649 PCT/US99/07041 



promoter, and expressing the heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous 

nucleic acid sequence which encodes for a protein which modifies DXP synthase. 

GAP dehydrogenase and/or LYTB activity in the host cell, the method comprising 
5 inserting into the host cell a vector comprising the heterologous nucleic acid 

sequence, wherein the heterologous nucleic acid sequence is operably linked to a 

promoter, and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is to provide a method for screening 

for eukaryotic genes which encode enzymes involved in isoprenoid biosynthesis 
10 and metabolism. 

RRIFF nFR^R IPTinM DF THF DRAWINGS 

A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better 
understood by reference to the following detailed description when considered in 
15 connection with the accompanying drawings, wherein: 

Figure 1 is an illustration showing that IPP and DMAPP serve as the central, 
metabolites leading to an immense variety of different isoprenoid compounds in 
plants, bacteria, and other organisms. The mevalonic acid pathway via acetyl-CoA 
in the cytosol of plants and in animals is well characterized. A second route from 

20 pyruvate and GAP, not yet elucidated, is thought to operate in the chloroplasts of 
plants and algae, in cyanobacteria, and in many bacteria. Later reaction steps in 
the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
because it has not been established which of these two is the initial product of this 

25 pathway. Abbreviations: G3P, glyceraldehyde-3-phosphate; G3PD, GAP 
dehydrogenase; DXPS, DXP synthase; IPP. isopentenyl pyrophosphate; IPI, IPP 
isomerase; DMAPP. dimethylallyl pyrophosphate; FPP, farnesyl pyrophosphate; 
GPP geranyl pyrophosphate; GGPP. geranylgeranyl pyrophosphate. 

Figure 2 illustrates a route to IPP/DMAPP from pyruvate and glyceraldehyde-3- 
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phosphate (GAP), which is thought to operate in the chloroplasts of plants and 
algae, in cyanobacteria, and in many bacteria including E. coll Later reaction steps 
in the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 

5 because it has not been established which of these two is the initial product of this 
pathway. Enzymes of interest are shown in bold white text in a black box. 
Abbreviations: DMAPP, dimethylallyl pyrophosphate; DXPS, deoxyxyIulose-5- 
phosphate synthase; DXPR, deoxyxylulose-5-phosphate reductoisomerase; FPS, 
farnesyl pyrophosphate synthase, GAPD, glyceraldehyde-3-phosphate 

10 dehydrogenase; GGPP, geranylgeranyl pyrophosphate; IPI, isopentenyl 
pyrophosphate isomerase; IPP, isopentenyl pyrophosphate. 

Figure 3 illustrates that the carotenoid phytoene is derived by a head-to-head 
condensation of two molecules of the C M GGPP compound, which itself is 
assembled from 3 molecules of IPP and 1 molecule of DMAPP. Abbreviations: 
15 FPP, farnesyl pyrophosphate; GPP geranyl pyrophosphate; PPPP, prephytoene 
pyrophosphate. 

Figure 4 is a schematic illustration and restriction map of plasmid pAC-LYC 
(Cunningham et al., 1994) which contains genes (crtE, crtB, and crtl) of Erwinia 
herbicola encoding all of the enzymes required for production of the pink-colored 
20 isoprenoid pigment lycopene from the colorless IPP and DMAPP compounds. Cm, 
chloramphenicol resistance gene. 

Figure 5A is a cDNA sequence and Figure 5B is the predicted amino acid sequence 
of a putative DXP synthase isolated from a flower cDNA library of Tagetes erecta 
(SEQ ID NOS: 1 and 2). The cDNA is incorporated into the plasmid pMarDXPS. 

25 Figure 6A is a cDNA sequence and Figure 6B is the predicted amino acid sequence 
of a chloroplast isoform of GAP dehydrogenase isolated from Arabidopsis thaliana 
(SEQ ID NOS: 3 and 4). The cDNA is incorporated into the plasmid pAtG3PD. 

8 
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Figure 7A is a cDNA sequence and Figure 7B is the predicted amino acid sequence 
of a LYTB protein isolated from an Adonis palaestina flower cDNA library (SEQ ID 
NOS: 5 and 6). The sequence is of clone Ipi3 except that the 14 bp at the N- 
terminus were obtained from the slightly longer cDNA clone Ipi18. The cDNA is 
5 incorporated into the plasmid pApLYTB. 

Figure 8 is an alignment of the predicted amino acid sequences of LYTB from 
Adonis palaestina, Synechocystis PCC6803 and E coli. The N-terminal extension 
of the Adonis polypeptide, relative to that of Synechocystis PCC6803, is predicted 
by the program ChloroP (Emanuelsson et al., 1999) to constitute a chloroplast 
10 transit peptide, serving to target the polypeptide to this organelle in plants. Black 
boxes with white letters are used where all three of the aligned residues are 
identical. Grey boxes with black letters are used where two of the three aligned 
residues are identical. Abbreviations: Ap, Adonis palaestina; Sy, Synechocystis 
PCC6803; Ec, Escherichia coli. 

15 Figure 9 is a schematic representation of the mapping of an E. coli genomic 
fragment to ascertain which of the genes in this fragment will enhance or impair 
lycopene accumulation in E. coli. Deletion mapping of an E. coli genomic fragment 
(the insert in plasmid pEc3.9 is essentially identical to GenBank U32768: 
4879..8819) with genes encoding DXP synthase (orf620), FPP synthase (ispA) and 

20 the small subunit of exonuclease VII (xseB). Numbers to the right indicate relative 
lycopene accumulation per mL of liquid culture with plasmids in lycopene- 
accumulating E. coli strain TOP10. The insert in pEc3.9 is oriented in the forward 
direction in the multicopy plasmid vector pBluescript SK-. The EcoR\ and Sma\ 
sites in the vector preceding the genomic fragment and a Kpn\ site following it were 

25 used, along with Mtel, Smal, and Sa/I sites in the genomic DNA, to construct the 
deletion subclones illustrated. Incomplete genes and open reading frames are not 
shown. 

nF8rR| p T ir>N r>F TH P PRFFFRRFO FMROOIMFNTS 

The term "isoprenoid" is intended to mean any member of the class of 
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naturally occurring compounds whose carbon skeletons are composed, in part or 
entirely of isopentyl C s units. Preferably, the carbon skeleton is of an essential oil. 
a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A search of the public databases (GenBank) for genes encoding 
homologues of the E. coli lytB gene product unearthed such genes in twelve 
bacteria and in the cyanobacterium Synechocystis PCC6803. A recent publication 
(Potter et al.. 1998) discusses the distribution of this gene in various bacteria. Our 
description herein of a homologue in the plant Adonis palaestina reveals for the first 
time that this gene is present and expressed in a eucaryotic organism. A recently 
deposited genomic DNA sequence for the green plant Arabidopsis thaliana 
(GenBank accession number AL035521 ) contains what appears to be a gene (the 
probable coding sequence is interrupted by several apparent introns) encoding 
LYTB in this organism. The predicted sequence of this Arabidopsis LYTB is 
somewhat uncertain (a comparison of the Adonis sequence with that listed given 
in AL035521 for Arabidopsis suggests that several of the exon-intron junctions 
predicted for this gene in the GenBank record are incorrect), but sequence identity 
in a comparison with the Adonis sequence is ca. 80% or more. Partial cDNA 
sequences in the data base of expressed sequence tags (dbEST) that predict 
peptides with sequence similarity to portions of the Adonis LYTB sequence indicate 
that homologues exist and mRNAs encoding LYTB are produced in several other 
plant species including rice (D45948), loblolly pine (AA556723) and soybean 
(AI437981). Both the Adonis and Arabidopsis predicted amino acid sequences are 
more than 60% identical to that predicted by the cyanobacterium Synechocystis 
PCC6803 gene, and the two plant and the cyanobacterial sequences are more than 
30% identical to the predicted E. coli gene product. An alignment of the Adonis. 
Synechocystis PCC6803 and E. coli predicted amino acid sequences is shown in 
Figure 7. A number of regions and residues conserved in LYTB are indicated in 
this Figure. 

The term "LYTB activity" is intended to mean the ability of LYTB to affect the 
production of IPP, DMAPP and/or isoprenoids in a host cell containing the lytB 
gene or DNA copy of the lytB mRNA. It has not yet been confirmed that the LYTB 
protein is, in fact, an enzyme. The precise role of the LYTB protein in affecting the 

10 
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production of isoprenoids has not been established. However, whatever the 
mechanism of action of the lytB gene product, we intend to cover that mechanism 
of action by the term "LYTB activity". 

The present invention is directed to an isolated nucleic acid sequence which 
5 encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 2. Preferably, the nucleic acid sequence is at least 
90%. at least 95% or completely identical to SEQ ID NO: 1 . or the nucleic acid 
10 sequence encodes a protein which has an amino.acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
15 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. Preferably, the nucleic acid sequence is at 
least 90%. at least 95% or completely identical to SEQ ID NO: 3. or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%. at least 95% or completely identical to SEQ ID NO: 4. 
20 A further subject of the present invention is an isolated nucleic acid 

sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. Preferably, the nucleic acid sequence is at least 
25 90%. at least 95% or completely identical to SEQ ID NO: 5, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 6. 

In each case, sequence similarity is measured using sequence analysis 
software, for example, the Sequence Analysis software package of the Genetics 
30 Computer Group (University of Wisconsin Biotechnology Center, 1710 University 
Avenue, Madison, Wisconsin 53705). MEGAIign (DNAStar. Inc., 1228 S. Park St. 
Madison, Wisconsin 53715). or MacVector (Oxford Molecular Group. 2105 S. 
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Bascom Avenue, Suite 200, Campbell. California 95008). Such software matches 
similar sequences by assigning degrees of identity to various substitutions, 
deletions, and other modifications. Conservative (i.e. similar) substitutions typically 
include substitutions within the following groups: glycine and alanine; valine. 
5 isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; 
serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle. J. Mol. Biol. 157: 105-132 (1982)), or on the 
basis of the ability to assume similar polypeptide secondary structure (see Chou 
10 and Fasman, Adv. Enzymol. 47: 45-1 48 (1 978)). If comparison is made between 
nucleotide sequences, preferably the length of comparison sequences is at least 
50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or 
at least 100 nucleotides. It is most preferred if comparison is made between the 
nucleic acid sequences encoding the protein coding regions necessary for protein 
15 activity. If comparison is made between amino acid sequences, preferably the 
length of comparison is at least 20 amino acids, more preferably at least 30 amino 
acids, at least 40 amino acids or at least 50 amino acids. It is most preferred if 
comparison is made between the amino acid sequences in the protein coding 
regions necessary for protein activity. 
20 The present inventors have isolated eukaryotic genes encoding DXP 

synthase from Tagetes erecta (marigold), GAP dehydrogenase from Arabldopsis 
thaliana, and LYTB from Adonis palaestina. All were identified on the basis of an 
enhancement of lycopene accumulation in E. coil The £ coli DXP synthase was 

also identified in this same way. 

25 Suitable vectors according to the present invention comprise a gene 

encoding one or more of the above-identified enzymes involved in IPP. DMAPP 
and/or isoprenoid biosynthesis or metabolism, wherein the gene is operably linked 
to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et al., Molecular 

30 Clonin g A Labora tor y Manual . Cold Spring Harbor Laboratory. Cold Spring Harbor, 
NY. 1989; Ausubel et al.. finrr»"« PmtonoK in Mnlmitar Biology. Greene 
Publishing and Wiley Interscience. New York. 1991). Suitable vectors for 
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eukaryotic expression in plants are described in Frey et al.. Plant J. (1995) 8(5):693 
and Misawa et al, 1994a. Suitable vectors for prokaryotic expression include 
pACYC184, pUC1 19, and pBR322 (available from New England BioLabs, Bevery, 
MA) and pTrcHis (Invitrogen) and pET28 (Novagen) and derivatives thereof. The 
vectors of the present invention can additionally contain regulatory elements such 
as promoters, repressors, selectable markers such as antibiotic resistance genes, 
etc., the construction of which is very well known in the art. 

One or more of the genes encoding the enzymes as described above, when 
cloned alone or in combination into a suitable expression vector, can be used to 
overexpress these enzymes in a plant expression system or to inhibit the 
expression of these enzymes. For example, a vector containing one or more of the 
genes of the invention may be used to increase the amount of isoprenoids in an 
organism and thereby alter the nutritional or commercial value or pharmacology of 
the organism. 

15 Therefore, the present invention includes a method of enhancing the 

production of IPP, DMAPP and/or an isoprenoid in a host cell, relative to an 
untransformed host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein 
having DXP synthase, GAP dehydrogenase and/or LYTB activity, wherein the 

20 heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention also includes a method of modifying the production of IPP, 
DMAPP and/or an isoprenoid in a host cell, the method comprising inserting into 
the host cell a vector comprising a heterologous nucleic acid sequence which 

25 encodes for a protein which modifies DXP synthase, GAP dehydrogenase and/or 
LYTB activity in the host cell, relative to an untransformed host cell, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention further includes a method of expressing, in a host cell, a 

30 heterologous nucleic acid sequence which encodes for a protein having DXP 
synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
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sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter, and expressing the heterologous nucleic acid sequence. 

The invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein which modifies 
5 DXP synthase, GAP dehydrogenase and/or LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 
10 Preferably, the isoprenoid comprises a compound derived from at least one 

member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). By the 
term "derived from", we mean that the isoprenoid contains at least one of the listed 
compounds as a building block. It is also preferable if the isoprenoid comprises at 
15 least one member selected from the group consisting of a diterpene, a carotenoid, 
an essential oil. a fragrance, an isoprene, a cytokinin. a rubber, a quinone, a sterol, 
a hopanoid, a triterpene. a steroid, a prenylated protein, a phytoalexin, a gibberellin, 
a tocopherol, a dolichol, a chlorophyll and a therapeutic compound. 

It is most preferred if the isoprenoid comprises at least one member selected 
20 from the group consisting of an essential oil, a fragrance, a rubber, a carotenoid 
and a therapeutic compound, such as paclitaxel. 

The heterologous nucleic acid sequence may originate from a eukaryotic or 
procaryotic cell. By the term "originate from", we intend to mean that the sequence 
information for the heterologous nucleic acid came from the eukaryotic or the 
25 procaryotic cell. However, the specific nucleic acid itself does not have to be from 
the organism. The nucleic acid may come from the organism, or it may be 
synthetically produced using recombinant nucleic acid techniques known in the art. 

Preferably, the heterologous nucleic acid sequence comprises a nucleotide 
sequence for dxps, gapd and/or lytB. It is most preferred that the heterologous 
30 nucleic acid sequence comprises a nucleotide sequence which encodes a dxps. 
gapd and/or a lytB gene and is at least 85% identical, preferably at least 90%. at 
least 95% or completely identical, to SEQ ID NO: 1 . 3 and/or 5. respectively, or the 
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15 



20 



25 



30 



nucleic acid sequence encodes a protein which has an amino acid sequence which 
is at least 90%, at least 95% or completely identical to SEQ ID NO: 2, 4 and/or 6, 
respectively. Identity is determined as noted above. 

The term "modifying the production" in the methods of the invention means 
that the amount of target compounds produced (e.g., IPP, DMAPP and/or 
isoprenoids) can be enhanced or reduced, as compared to an untransformed host 
cell. Thus, in accordance with an embodiment of the present invention, the 
production or the biochemical activity of the target compounds (or the enzymes 
which catalyze their formation) may be reduced or inhibited by a number of different 
approaches available to those skilled in the art. including but not limited to such 
methodologies or approaches as anti-sense (e.g.. Gray et al.. 1992), ribozymes 
(e.g.. Wegener et al., 1994), co-suppression (e.g. Fray and Grierson. 1993). 
targeted disruption of the gene (e.g.. Schaefer et al.. 1997), intracellular antibodies 
(e.g., see Rondon and Marasco, 1997) or whatever other approaches rely on the 
knowledge or availability of the nucleic acid sequences of the invention, or the 

proteins encoded thereby. 

Host systems according to the present invention can comprise any organism 
that utilizes a nonmevalonate (i.e., via DXP) pathway for production of IPP and/or 
DMAPP. Organisms which produce isoprenoids using IPP and/or DMAPP derived 
from a nonmevalonate pathway include plants, algae, certain bacteria, 
cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard 
techniques. See, for example. Sambrook et al., Mo l ftcn l ar Ginning A I a horatory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor. NY, 1989; Ausubel 
et al.. Current Prntn^k in Macu lar Rinlogy. Greene Publishing and Wiley 

Interscience, New York. 1991. 

Alternatively, transgenic organisms can be constructed which include the 
nucleic acid sequences of the present invention. The incorporation of these 
sequences can allow the controlling of isoprenoid biosynthesis, content, or 
composition in the host cell. These transgenic systems can be constructed to 
incorporate sequences which allow for the overexpression of the various nucleic 
acid sequences of the present invention. Transgenic systems can also be 
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constructed which allow for the underexpression of the various nucleic acid 
sequences of the present invention. Such systems may contain anti-sense 
expression of the nucleic acid sequences of the present invention. Such anti-sense 
expression would result in the accumulation of the substrates of the enzyme 

encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

FXAMPI F 1 



ial gpnfis that p nhannR r.amtennifl accumulation 



zherichia coli 

The plasmid pAC-LYC (Cunningham et al., 1994 and 1996) contains genes 
encoding all of the enzymes required for the formation of lycopene from IPP and 
DMAPP (see Figure 3). Cells of Eco// containing the plasmid pAC-LYC accumulate 
the carotenoid lycopene and thereby form colonies on solid growth medium that are 
pink in color (Cunningham et al, 1994). When cDNA or genomic libraries of 
plants, cyanobacteria, or bacteria are introduced into the lycopene-accumulating 
E. coli strain (the complete methodology is contained in Cunningham ef al., 1994 
and 1 996). a rare few of the colonies formed after spreading of the cell culture onto 
solid growth medium are much deeper in color than is typical of the vast majority 
of the colonies. The enhancement of color and concomitant carotenoid pigment 
accumulation in £. co//is prima facie evidence for an enhancement of flux into and 
through the pathway or pathways leading to the immediate precursors of 
carotenoids and all other isoprenoid compounds: IPP and its allylic isomer DMAPP. 
Genes encoding subsequent enzymes in the pathway leading to lycopene are 
already present in the multicopy plasmid pAC-LYC (see Figure 3). 

Among the cDNAs and genes isolated with this screening methodology are: 
Tagetes erecta (marigold) cDNAs encoding a homologue of DXP synthase (the 
DNA sequence of the longest is given in Figure 4), an E.coli genomic clone 
containing the DXP synthase (and no other complete open reading: a genomic 
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fragment from the Sph\ site which lies at base 5210 in the sequence listed under 
GenBank U32768 to the A/del site at base 8003 in this GenBank sequence was 
found to enhance carotenoid accumulation in E. coli. This .region encompasses the 
open reading frame earlier referred to as f620, and now as the DXP synthase), 

5 Arabidopsis thaliana cDNAs encoding a chloroplast isoform (Shih et a/., 1991 and 
1992) of the enzyme GAP dehydrogenase (the sequence of the longest of these 
isolated cDNAs is given in Figure 5),and an Adonis palaestina cDNA encoding a 
homologue of the E. coli lytB gene product (Figure 6). The enhancement of 
isoprenoid biosynthesis in E. coli (as indicated by lycopene accumulation) when 

10 cDNAs or genes encoding DXP synthase or GAP dehydrogenase are introduced 
can be understood in the context of the biochemical pathway that has been 
postulated for production of IPP/DMAPP in E. coli (and in other bacteria, 
cyanobacteria and plants). However, at the date of the provisional filing of this 
application, there had been no indications that increased flux through the pathway 

15 to IPP/DMAPP might be obtained by increasing the expression of either of these 
enzymes. A publication after the priority date of this application (Harker and 
Bramley (1999)) shows a salutary effect on isoprenoid accumulation in E. coli in 
which foreign DXP synthase genes are introduced. 

The mechanism of enhancement of isoprenoid production by introduction of 

20 cDNAs encoding LYTB is not yet known. The nonmevalonate pathway leading to 
IPP and/or DMAPP has not yet been elucidated and these cDNAs may encode an 
enzyme subsequent to DXP synthase in the pathway. Alternatively, the 
enhancement of isoprenoid accumulation may involve a mechanism less direct 
(e.g., as for GAP dehydrogenase, involvement in biochemical reactions that utilize 

25 or supply the substrates GAP and pyruvate, or by exerting a regulatory influence 
on isoprenoid pathways). 

FVAMPI E 2 

Amhidnnsi* thaliana Tan*** emnta and Adonin nalaeitina cDNA I ihmries 



A size-fractionated 2-3 kB cDNA library of A. thaliana in lambda ZAPII 
30 (Kieber et al., 1 993) was obtained from the Arabidopsis Biological Resource Center 
at The Ohio State University (stock number CD4-1 5). Other size fractionated 
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libraries were also obtained (stock numbers CD4-13. CD4-14, and CD4-16). The 
cDNA libraries for Tagetes erecfa (marigold) and Adonis palaestina (pheasant's 
eye) were constructed for us by Stratagene using cDNAs isolated from flower 
tissues. An aliquot of each library was treated to cause a mass excision of the 
5 cDNAs and thereby produce a phagemid library according to the instructions 
provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue 
and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of 
£ coli TOP1 0 F (this strain contained the plasmid pAC-LYC) by incubation of the 
1 0 phagemid with the £. coli cells for 1 5 min at 37°C. Cells had been grown overnight 
at 30°C in LB medium supplemented with 2% (w/v) maltose and 10 mM MgS0 4 
(final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on 
an Eppendorf microfuge (541 5C) for 10 min. The pellets were resuspended in 10 
mM MgS0 4 to a volume equal to one-half that of the initial culture volume. 
15 Transformants were spread on large (150 mm diameter) LB agar petri plates 
containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming 
units were spread on each plate. Petri plates were incubated at room temperature 
for 2 to 7 days to allow maximum color development. Plates were screened visually 
20 with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare deep pink colonies that could be observed in the 
background of paler pink colonies. 



FXAMPI 

Fnhancement nfCarnten ^ ArmmnMion in F coli bVCDNAS and ffftflf?, ' ? encoding 

25 f vtr dxp syntha** ipp fen mpm . sft and GA P Dehydrogenase Individual l y and in 

Combination. 

Attempts were made to maximize production of three very different 
carotenoids in £. coli engineered to accumulate these isoprenoid pigments. The 
specific carotenoids chosen were the linear and lipophilic pink compound 
30 lycqpene, the bicyclic hydrocarbon p-carotene and the much more polar dihydroxy, 

.- x -j .u:_ rwD cwnthaeo IPP isnmerase. LYTB. and GAP 
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dehydrogenase had been observed to enhance the pigmentation of E. coli colonies 
growing on solid agar medium. For a more quantitative appraisal of the influence 
of these genes and cDNAs on carotenoid accumulation, the amounts of carotenoid 
pigments in liquid cultures were examined with the results given in Figure 1 . The 

5 marigold DXP synthase, an Arabidopsis IPP isomerase and the Adonis LYTB 
cDNAs gave rise to significant and substantial increases in the amount of 
carotenoids accumulated per volume of culture. A partially-sequenced Arabidopsis 
cDNA (GenBank AA605545) with sequence similarity to the Adonis lytB cDNA gave 
results similar to that of the Adonis cDNA (not shown). A synechocystis PCC6803 

10 lytB gene was less effective but gave a significant enhancement as well. See Table 
1. GAP dehydrogenase was slightly detrimental to pigment accumlation in liquid 
culture, in contrast to earlier observed enhancement for cultures grown on solid 
media. The influence of this gene on pigment accumulation may depend on the 
specific growth regimen. A similar result, enhancement for cultures on solid media 

15 but reduction in liquid culture, was also obtained for an £. coli genomic fragment 
containg DXP synthase (see Example 4 below and Figure 8). A combination of 
DXP synthase with IPP isomerase was significantly more effective at enhancing 
pigment accumulation than was either of the individual cDNAs. 

FXAMPI E 4 

20 Fnhancement *nri Rerintiinn of I vconene Accumulation in F coli containing 

multiple copies of gene* enc o ring ™P synthase and Farnesvl I 

synthase, respectively 

A number of the inserts in plasmids obtained from dark pink colonies 
selected in screens of the E. coli genomic library (see example 1) contained the 

25 gene encoding DXP synthase in this organism (see Sprenger et al., 1997; Lois et 
al., 1998). Deletion mapping of one such £. coli genomic fragment indicated that 
the dark pink colony phenotype and enhanced carotenoid accumulation in liquid 
culture was conferred by a fragment containing the DXP synthase and no other 
complete open reading (Figure 8). A genomic fragment containing the ispA and 

30 xseB genes immediately upstream of dxps was, in contrast, quite inhibitory for 
carotenoid accumulation (Figure 8). This diminution in pigmentation results, we 
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surmise, from the activity of the farnesyl pyrophosphate (FPP) synthase encoded 
by ispA (Fujisaki et al. t 1990). The FPP synthase would be expected to compete 
with geranylgeranyl pyrophosphate (GGPP) synthase (Figure 1) for IPP and 
DMAPP and thereby deplete the substrate available for carotenoid biosynthesis. 

Table 1 illustrates the influence of cDNAs encoding DXP synthase, GAP 
dehydrogenase, LYTB, and IPP isomerase, individually and in combination, on the 
accumulation of lycppene, p-carotene, or zeaxanthin in three strains of E. coli 
engineered to produce these isoprenoid compounds. 



Table 1 . Enhancement of carotenoid accumulation in lycopene, fa- 
carotene, and zeaxanthin-accumulating strains of E. coli by introduction of 
plant cDNAs and cyanobacterial genes individually and in combination. 



Plasmid 


cDN A or 
Gene 
Product 


Lycopene 
% of control 


b-Carotene 
% of control 


Zeaxanthin 
% of control 


pBluescript SK- 


Control (empty 
vector) 


(100) 


(100) 


(100) 


I pAtipiTrc 


Arabidopsis 
thaliana IPI 


202112 


22713 


22314 


pTedxps 


Tagetes erecta 
DXPS 


203 ±11 


193111 


228140 


pAdtytB 


Adonis 

palaestina LYTB 


15719 


18013 


16812 


p6803!ytB 


Synechocystis 
PCC 6803 LYTB 


116111 


111 11 


10912 


pAtgapA 


Arabidopsis 
thaliana GAPD 


7916 


nd 


7913 



Data were obtained with E. coti strain TOP10 containing pAC-LYC, pAC-BETA or 
pAC-ZEAX (Cunningham et al. f 1994 and 1996, Sun etal., 1996). Carotenoid 
content (per volume of culture) was measured after 48-72 h growth in LB medium. 
Cultures were inoculated with freshly grown individual colonies from agar plates. 
Values are mean + sd for 4 to 13 individual cultures. The empty plasmid pBluescnpt 
SK* (from Stratagene) served as the control and was used as the vector for ail of the 
plant cDNAs and the Synechocystis lytB gene. The influence of GAPD on 
accumulation of b-carotene was not determined (nd). ■ 
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Having now fully described the invention, it will be apparent to one of 
ordinary skill in the art that many changes and modifications can be made thereto 
without departing from the spirit or scope of the invention as set forth herein. 
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We claim: 

1 . A method of enhancing the production of isopentenyl pyrophosphate (IPP). 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

5 inserting into the host cell a vector comprising a heterologous nucleic 

acid sequence which encodes for a protein having deoxyxylulose-5-phosphate 
(DXP) synthase enzyme activity, wherein the heterologous nucleic acid sequence 

is operably linked to a promoter, and 

expressing the heterologous nucleic acid sequence, thereby 

10 producing the protein. 

2. The method of claim 1 . wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

3. The method of claim 1 . wherein the isoprenoid comprises at least one 
15 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 

4. The method of claim 1 , wherein the isoprenoid comprises a carotenoid. 

5. The method of claim 1 , wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

20 6. The method of claim 5, wherein the nucleic acid sequence for dxps encodes 
an amino acid sequence which is completely identical to SEQ ID NO: 2. 

7. The method of claim 1, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

8. The method of claim 1 , wherein the heterologous nucleic acid sequence 
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originates from a eukaryotic cell. 

9. The method of claim 1 , wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10. A method of modifying the production of isopentenyl pyrophosphate (IPP). 
5 dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies deoxyxylulose-5- 
phosphate (DXP) synthase enzyme activity in the host cell, relative to an 
10 untransformed host cell, wherein the heterologous nucleic acid sequence is 

operably linked to a promoter, and 

expressing the heterologous nucleic acid sequence, thereby 

producing the protein. 

11. The method of claim 1 0, wherein the isoprenoid is derived from at least one 
15 member selected from the group consisting of geranyl pyrophosphate (GPP). 

famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

12. The method of claim 10. wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil. a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

20 1 3. The method of claim 1 0, wherein the isoprenoid comprises a carotenoid. 

14. The method of claim 10. wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

1 5. The method of claim 1 0, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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16. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

1 7. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having deoxyxylulose-5-phosphate (DXP) synthase 

5 enzyme activity, the method comprising inserting into the host cell a vector 
comprising the heterologous nucleic acid sequence, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter, and expressing the 
heterologous nucleic acid sequence. 

1 8. The method of claim 1 7, wherein the heterologous nucleic acid sequence 
10 comprises a nucleic acid sequence for dxps. 

19. The method of claim 18, wherein the nucleic acid sequence for dxps 
encodes an amino acid sequence which is completely identical to SEQ ID NO: 2. 

20. The method of claim 1 7, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15 21 . The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

22. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

23. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
20 which encodes for a protein which modifies deoxyxylulose-5-phosphate (DXP) 

synthase enzyme activity in the host cell, relative to an untransformed host cell, the 
method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably 
linked to a promoter, and expressing the heterologous nucleic acid sequence. 
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24. The method of claim 23, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

25. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 26. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

27. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

10 inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter, and 

expressing the heterologous nucleic acid sequence, thereby producing the 

15 protein. 

28. The method of claim 27, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

29. The method of claim 27, wherein the isoprenoid comprises at least one 
20 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 

30. The method of claim 27, wherein the isoprenoid comprises a carotenoid. 

31. The method of claim 27, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 
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32. The method of claim 31, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

33. The method of claim 27, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

5 34. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

35. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

36. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
10 dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein which modifies glyceraldehyde-3-phosphate 
(GAP) dehydrogenase enzyme activity in the host cell, relative to an untransformed 
15 host cell, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

37. The method of claim 36, wherein the isoprenoid is derived from at least one 
20 member selected from the group consisting of geranyl pyrophosphate (GPP), 

famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

38. The method of claim 36, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

25 39. The method of claim 36, wherein the isoprenoid comprises a carotenoid. 
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40. The method of claim 36, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

41. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 42. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

43. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, the method comprising inserting into the host cell 

10 a vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

44. The method of claim 43, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

15 45. The method of claim 44/ wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

46. The method of claim 43, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

47. The method of claim 43, wherein the heterologous nucleic acid sequence 
20 originates from a eukaryotic cell. 

48. The method of claim 43, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

49. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
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which encodes for a protein which modifies glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity in the host cell, relative to an untransformed host 
cell, the method comprising inserting into the host cell a vector comprising the 
heterologous nucleic acid sequence, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter, and expressing the heterologous nucleic 
acid sequence. 

50. The method of claim 49, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

51 . The method of claim 49. wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

52. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

53. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

protein. 

54. The method of claim 53. wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

55. The method of claim 53, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 
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56. The method of claim 53. wherein the isoprenoid comprises a carotenoid. 



57. The method of claim 53, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

58. The method of claim 57, wherein the nucleic acid sequence encodes an 
5 amino acid sequence which is completely identical to SEQ ID NO: 6. 

59. The method of claim 53, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

60. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

10 61. The method of claim 53. wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

62. A method of modifying the production of isopentenyl pyrophosphate (IPP). 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

1 5 inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein which modifies LYTB activity in the host cell, 
relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

20 protein. 

63. The method of claim 62, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

64. The method of claim 62, wherein the isoprenoid comprises at least one 

33 



<WQ 99S8649A1 _l_> 



WO 99/58649 PCT/US99/07D41 



member selected from the group consisting of an essential oil. a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

65. The method of claim 62, wherein the isoprenoid comprises a carotenoid. 

66. The method of claim 62, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

67. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

68. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

69. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having LYTB activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, 
wherein the heterologous nucleic acid sequence is operably linked to a promoter, 
and expressing the heterologous nucleic acid sequence. 

70. The method of claim 69. wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

71 . The method of claim 69, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 6. 

72. The method of claim 69. wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

73. The method of claim 69. wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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74. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

75. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

76. The method of claim 75, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

77. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

78. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 
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Figure 5A: SEQ ID NO: 1 ^ ^ 

Tagetes erecta (marigold) cDNA for DXP synthase 
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gatagggctg 


gtttggtagg 


agcagatggt 


ccaactcacc 


1601 


gtggtgcatt 


tgatacaaca 


ttcatggctt 


gtttaccaaa 


catggtggtt 


1651 


atggctcctt 


catgcgaggc 


agagctcatg 


aacatggtgg 


ccacagctgt 


1701 


agccatcgat 


gaccgtccta 


gctgttttag 


gtacccaaga 


ggaaatggaa . 


1751 


taggctccat 


actaccagca 


aataacaaag 


gaaccttaat 


cgaggttgga 


1801 


actggaaggg 


tgattaaaga 


gggaaataga 


gtggccttat 


egggatatgg 
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FIGURE 5A (cont.) 



1851 aactatagtt caaagttgct tagcagcaag tgaagttctt aaaaaaattg 

1901 gcatctctgt gaccgtggcg gatgcacgat tctgcaagcc gcttgatgga 

1951 aatttaatca aacaactagc aaacgagcat gaggtcctca tcacagttga 

2001 agaaggctct ataggaggat ttagctctca cgtctctcat ttcttggcct 

2051 taaatggatt acttgatgga caccttaagt ggagagcaat gacgttgcct 

2101 gatcgatata ttgaacatgg ggctcaatct gaccagattg aggaggctgg 

2151 tcttagctca aagcatattg cagcaacagt tttatcattg attgggggca 

2201 gcaaagagac actacatgca ctcaacgtat aaattgaaat atgccatgct 

2251 agatactata aaagacaatg ctttgctgga ctcttttgat catatatata 

2301 tatatatata tatatataca tatatatatt gtagttttcg atcatataca 

2351 tattgtactt ttcttgtaac atacatggtc gcgttgataa ttgttgacat 

2401 aatcccotcc 
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Figure 5B: SEQIDNO:2 7^3 

PREDICTED AMINO ACID SEQUENCE of Tagetes ere eta (marigold) cDNA for 
DXP synthase 

TRANSLATION from: 55 to: 2232 Length: 726 



1 


MALCGALKGG 


FVPIAQNGYT 


SSSLLNPSAN 


AIMPSNKRKF 


LGIVAVSKEH 


51 


ATNEHEDLTT 


MDKTTSTTLK 


YSGDKPKXPI 


LDTINYPIHM 


KNLCVEELVK 


101 


IiADELREEXV 


YTVSKTGGHL 


SSSLGWELT 


VSLEHVFNTP 


EDKIIWDVGH 


151 


QAYPHKTLTG 


RRSRMRTIRQ 


TFGLAGFPKR 


DESNHDAFGA 


GHSSTSISAG 


201 


LGMAVGRDIiL 


GKNNHVTAVT 


GDGAMTAGQA YEAMNNAGYL DSNLIIYLND 


251 


NRQFSLPTAT 


IDGPAPPVGA LSRSLTRLQT 


SQKFRQLREA AKEVTKQLGD 


301 


KTHEVAAKMD 


SLVKGMVGGQ 


GASMFEELGL 


YYVGPVDGHN 


LEDLVYVFDK 


351 


IKSMTAPGPV 


LVHIVTEKGK 


GYFPAEVAAD 


KMHGWKFDT 


QTGKQKKNKT 


401 


KTLSYTQYFV 


DSLVAEAKED 


DKIVAIHAAM 


GGGTGLNTFQ 


KEFPARCFDV 


451 


GIAEQKAITF 


AAGLATEGLK 


PFCAIYSSFI* 


QRGYDQWHD 


VDLQKLPVRF 


501 


AMDRAGLVGA 


DGPTHCGAFD 


TTFMACLPNM 


WMAPSCEAE 


LMNMVATAVA 


551 


IDDRPSCFRY 


PRGNGIGSIL 


PANNKGTLIE 


VGTGRVIKEG 


NRVALLGYGT 


€01 


IVQSCLAASE 


VLXKIGISVT 


VADARFCKPL 


DGNLIKQLAN 


EHEVLITVEE 


651 


GSIGGFSSHV 


SKFUVLNGLL 


DGHLKWRAMM 


LPDRYIHHGA 


QSDQIEEAGL 


701 


SSKHIAATVL 


S I* X GGS Kjl TLi 


HALNV* 
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Figure 6A: SEQ ID NO: 3 

Arabidopsis thaiiana cDNA for a chloroplasts isoform of glycera!dehyde-3- 
phosphate (GAP) dehydrogenase. 



1 


ctgtccccaa 


gggtttcact 


gaattctcag 


gattgegaag 


ctcctctgct 


51 


tctcttccct 


tcggcaagaa 


actttcttcc 


gatgagttcg 


tttccatcgt 


101 


ctccttccag 


acttctgcaa 


tgggaagcag 


tggtggatac 


aggaaaggtg 


151 


tgactgaagc 

W WW* 


caagcttaag gtggccatta 


atggattcgg 


taggateggg 


201 


aagaacttcc 


tgagatgttg gcatggtcgc 


aaggactctc 


ctcttgatat 


251 


cattgccatt 


aatgacactg gtggcgtcaa 


gcaggcttcg 


catttactta 


301 


aatacgactc 


tactctcgga 


atctttgatg 


ctgatgtcaa 


accttctgga 


351 


qaqactgcaa 


tctctgttga 


tggaaagatc 


atccaagctg 


tctctaaccg 


* 

401 


aaacccgtct 


cttctccctt 


ggaaggagct 


aggaattgac 


attgtcatcg 


451 


aaggaaccgg 


agtgrttg-g 


gatagagaag - gtgcagcgaa acacat tgaa 


501 


qctactgcca 

w — * w w 


agaacgtcat 


cattactget 


ccaggcaaag 


gagatattcc 


551 


aacttatgtc 




atgcagatgc 


ttacagtcat 


gatcaaccta 


601 


tcatcagcaa 


tgcatcrtgc 


actaccaact 


gtcttgcccc 


C t £ wCf t C3.3Lcl 


651 


gcccttgacc 

w 


agaaaticcgg 


tatcataaag 


ggtacaatga 


cgactactca 


701 


ctcttacacc 


ggcgaccaga 


ggcegctaga 


cgcgagccac 


cgtzgatctaa 


751 


ccaQaocaao 


accaccccct 

arf 


ttgaacattg 


ttcctacrcc 


tacacgagca 


801 


gctaaagctg 


t cere wtlw ci w 


gcrccctaac 


ctcaaagcaa 


aactcaaegg 


851 


gatcgctctc 


cgtgraccaa 


caccaaacgt 


atcagtggtt 


gatcrcgttg 


901 


tgcaggtctc 


aaagaagaca 


tttgecgagg 


aagtcaaege 


tgctttcaga 


951 


gattctgcag 


agaaagagct 


taaaggcata 


ctcgatgtct 


gegatgagee 


1001 


actagtgtcc 


gctgarttca gatgetcaga 


tttttcaacg 


accattgatt 


1051 


catcactcac 


tatggctatg 


ggagatgata 


tggttaaggt 


gattgettgg 


1101 


tatgataatg 


aatggcgctia 


ctcacagaga gttgttgact 


tggctgacat 


1151 


tgttgccaac 


aactggaagc 


gatttcatca 


aactttgttt 


catgtttttc 


1201 


catctttttc 


tctttctctt 


ttttacatta 


tggttctcaa 


aatatcggtg 


1251 


aagactgtat 


a 
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Figure 6B: SEQIDNO:4 gy-jj 

Predicted amino acid sequence of Arabidopsis thaRana cDNA for a chloroplasts 
isoform of glyceraldehyde-3-phosphate (GAP) dehydrogenase. 

1 VPKGFTEFSG LRSSSASLPF GKKLSSDEFV SIVSFQTSAM GSSGGYRKGV 

51 TEAKLKVAIN GFGRIGRNFL RCWHGRKDSP LDIIAINDTG GVKQASHLLK 

101 YDSTLGIFDA DVKPSGETAI SVDGKIIQW SNRNPSLLPW KELGXDIVTE 

151 GTGVFVDREG AGKHIEAGAK KVTITAPGKG DIPTYWGVN ADAYSHDEPI 

201 ISNASCTTNC LAPFVKVLDQ KFGIIKGTMT TTHSYTGDQR LLDASKRDLR 

251 RAEAAALHIV PTSTGAAKAV ALVLPNLKGK LNGIALRVPT PNVSWDLW 

301 QV5KKTFAEE VNAAFRDSAE KELKGILDVC DEPLVSVDFR CSDFSTTIDS 

351 SLTMVMGDDM VKVIAWYDNE WGYSQRWDL ADIVAKNWK* FHQTLFHVFP 

401 FFSLSLFYIM VLKISVKTV 



* 
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Figure 7A: SEQ ID NO: 5 
Adonis palaestina cDNA 
Length: 1619 June 4, 1998 



10/13 



i 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 



ccttcgtctc tgcacttctt tcttccatgg cgacttctct tcaattctgc 
cgtttctcca ccccatctga tctctccttc cctgaaaccc gctcttccac 
tcgcctctac cgatccaaga aaccattttc cgtccgttgt cacagtgaag 
gaccttcagg ctcctcctcc acggcagtgg aatccgaatt cgatgccaag 
tctttccgcc acaaccttac aagaagcaag aactacaacc gaagaggttt 
cggccacaaa gatgagactc ttgagcttat gaacagcgag tatacaagtg 
atgttattaa gaagttgaag gagaatggaa acgagtacag ttggggacct 
gttacagtga aattggctga gtcgtatggg ttctgttggg gtgtggaaag 
agctgttcag attgcttatg aagctagaaa acagtttccc gatgagaaga 
tttggattac taatgagatt atccataatc caactgttaa caagagatta 
gaagagatgg aagttaaaga gattccagtt ggggatggga agaaacattt 
tgatgttgtt gcaaaggctg acgctgtgat tttgccggct tttggagctg 
ccgtaagtga aatgcrgact tzgagcgaca aaaatgtcca gatagtcgac 
acaacttgcc catgcgtatc taacctgtgg aattccgttg agaagcataa 
gaagggagag tacacaccaa tcattcacgg taaatattcc cacgaggaaa 
ctattgcaac tgcatcttct gctgggaagt acatcattgc gaagaacatg 
gatgaggcaa tgtatgtgcg cgaczacacz cttggaggtg agctcaatgg 
atccagctca gataaacagg cactcctgca gaaattcaaa tacgcaattt 
ccgaaggatt tgatccagac acacatctta ttaaaacagg cattgcaaac 
caaactacaa tgttgaaggg agaaacagaa gatattggta' aacttctcga 
aaagactatg atgcgcaagt atggagttga gaatattaat gaccacttca 
taagctttaa cacaatatgt gatgctacac aacagagaca agacgccatg 
tttaagttgg tagaggagaa agtggatctc atattagcag ttggtggatg 
gaattcaagt aacacctcac atctacaaga aatttctgag ttacgtggca 
ttccttctta ctggattgac agtgagacga gaataggacc aggaaacaag 
attagccaca aattgaatca tggtgagcta gttgagaccg aaaactggct 
cccagagggt cctgttacaa ttggtgtgac atcaggtgct tctaccccag 



ataaggctgt tgaagatgct cttgtaaagg 

caattgttgc agctagcata gagatcagtg 

caccactgtt caatatgtgt gattgtatat 

atatagaaac tgtaatacct tgttatgtta 



tgttcgacat caagcgcgaa 
ttctgcagag atcaaagaac 
gtatcatgga gaacacaggg 
tactatatgg atgtccttta 



ccataatgat gtaaattttt agttccatgc aacaatatt 



t tc 
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Figure 7B: SEQIDNO:6 11/13 

Predicted amino acid sequence of Adonis palaestina LYTB 

Length: 465 Translate 27-1421 

1 MATSLQFCRF STPSDLSFPE TRSSTRLYRS KKPFSVRCHS EGPSGSSSTA 

SI VESEFDAKSF RHNLTRSKNY NRRGFGHKDE TLELMNSEYT SDVIKKLKEH 

101 GNEYSWGPVT VKLAESYGFC WGVERAVQIA YEARKQFPDE KIWITNEIIH 

151 NPTVNKRLEE MEVKEIPVGD GKKHFDWAK GDWILPAFG AAVSEMLTLS 

201 EKNVQIVDTT CPWVSKVWNS VEKHKKGEYT SIIHGKYSHE ETIATASFAG 

251 KYI IVKNMDE AMYVCDYILG GELNGSSSDK QALLEKFKYA ISEGFDPDTD 

301 LIKTGIANQT TMLKGETEDI GKLLEKTMMR KYGVENINDH FISFNTICDA 

351 TQERQDAMFK LVEEKVDLIL WGGWNSSNT SHLQEISELR GIPSYWIDSE 

401 TRIGPGNKIS HKLNHGELVE TENWLPEGPV TIGVTSGAST PDKAVEDALV 

451 KVFDIKREQL LQIA* 
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FIGURE 8 
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FIGURE 9 
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